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ABSTRACT 

In  this  paper  a processor  utilization  model  for  a simplified  multi- 
processor computer  system  is  developed  using  a G/M/s/N  queueing  system 
model.  Jobs  are  assumed  to  arrive  according  to  a general  input  process, 
and  each  job  is  assigned  randomly  to  an  available  processor.  A finite 
capacity  input  buffer  is  used  if  no  processor  is  available.  The  mathe- 
matical model  is  based  on  the  busy  period  analysis,  and  two  utilization 
measures  are  derived: 

(1)  Processor  utilization  (the  fraction  of  processor  occupation 
time  during  a busy  cycle) , and 

(2)  System  utilization  (the  fraction  of  actual  utilization  time 
for  all  processors). 

Additionally,  an  approximate  arbitrary  time  state  probability  distribu- 
tion is  obtained  and  it  serves  as  the  basis  for  the  above  measures  in 
addition  to  others.  Further  approximations  enable  the  development  of  a 
computational  model  from  the  mathematical  model.  Experimentation  with 
the  computational  model  reveals  the  sensitivity  of  the  model  to  varia- 
bility in  the  arrival  process.  Comparison  of  2-processor  and  4-piocessor 
systems  from  the  operator  perspective  indicates  a qualified  preference 
for  the  behavior  of  the  2-processor  system.  This  preference  must  be 
carefully  interpreted  since  processor  costs,  the  increase  in  overhead 
with  an  increase  in  processors,  and  behavioral  variables  reflecting  the 
user  perspective  are  excluded. 

Keywords:  multiprocessor,  processor  utilization,  finite  buffer  capacity. 
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INTRODUCTION 

Systems  utilizing  two  or  more  interconnected  processing  units  in 
order  to  execute  two  or  more  different  programs  or  tasks  simultaneously 
[1,  p.  305]  are  known  as  multiprocessing  computer  systems.  Interest 
in  these  systems  began  quite  early.  A 1963  paper  by  Critchlow  [2] 
contains  some  44  references.  Both  General  Electric  (GE  645)  and 
Control  Data  (CDC  6500)  introduced  multiprocessor  systems  in  the  mid 
1960's;  however,  interest  in  these  systems  has  increased  markedly  in 
recent  years.  This  interest  has  been  stimulated  by  several  inter- 
related developments: 

(1)  the  reduction  in  the  cost  of  main  frames, 

(2)  the  rapid  emergence  of  the  mini-  and  micro-processors 
accompanied  by  the  perspective  of  distributed  computing 
systems,  and 

(3)  the  increased  motivation  for  sharing  of  resources  through 
computer  networks  using  high  speed  data  communications. 

The  recent  compilation  of  material  edited  by  Enslow  [16]  gives  a more 
precise  definition  of  a multiprocessor  system  and  explains  the  varia- 
tions in  design  and  topologies  among  such  systems.  Regardless  of  the 
topology  of  a multiprocessor  system,  such  as  a large  duplex  main  frame 
located  in  the  same  room,  a pipeline  or  array  processor  utilizing  ex- 
tensive concurrent  operations  or  a set  of  minicomputers  distributed 
geographically  and  organizationally  throughout  a major  firm,  efficiency 
is  still  a major  issue. 

Efficiency  is  a familiar  byword  to  operations  research  practitioners, 
but  concern  for  efficient  computation  among  computer  scientists  emerged 
as  recently  as  the  late  1960s.  Prior  to  that  time  the  challenge  took 
the  form  of  an  "existence  proof"  — making  the  system  run,  with  little 
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resources  available  to  even  assess  whether  it  runs  well.  The  realities 
of  the  economic  environment  has  forced  a much  different  perspective  in 
recent  years,  and  an  emphasis  on  performance  measures,  system  models, 
and  the  development  of  an  evaluation  methodology  is  apparent. 

The  research  described  here  explores  the  processor  utilization  in 
a multiprocessing  system.  We  are  careful  to  emphasize  that,  although 
efficiency  and  utilization  are  closely  related,  a distinct  difference 
between  the  two  must  be  recognized.  Efficiency  is  a measure  based  on 
the  use  of  a computer  system  resource,  such  as  a central  processing 
unit  (CPU),  in  the  processing  of  particular  user  tasks.  Utilization 
is  a measure  of  the  period  of  use  of  a resource  for  either  user  or 
system  tasks.  Consequently,  a resource  could  be  heavily  utilized 
but  very  inefficiently  used.  An  example  is  a CPU  in  which  the  operating 
system  tasks  demand  95  percent  of  the  processing  time.  We  believe  that 
a measure  of  efficient  use  of  a system  resource  must  include  utilization 
as  one  major  component. 

Within  the  limitations  of  processor  utilization  as  a behavioral 
measure,  we  investigate  the  reaction  to  different  job  arrival  distribu- 
tions and  compare  the  behavior  of  2-processor  and  4-processor  systems, 

A mathematical  model  of  a simplified  multiprocessor  system  is  constructed 
and,  by  employing  several  approximations,  we  develop  a computational 
model  to  enable  experimental  work. 


Previous  Modeling  Treatments 

Prior  models  of  performance  in  multiprocessing  systems  are  generally 


directed  toward  solving  one  of  two  problems:  (1)  the  testing  of  various 


scheduling  disciplines,  possibly  with  the  intent  of  identifying  the  most 
preferred  among  a set  of  alternatives  [4,5,6]  or  (2)  the  description  of 
the  memory  contention  (interference)  arising  from  the  use  of  a common 
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Figure  1 provides  a sketch  of  our  simplified  model  of  a multi- 
processing system.  Arriving  jobs  are  assigned  by  a "dispatcher"  im- 
mediately to  a "free"  processor.  If  all  processors  are  busy,  jobs  are 
assigned  to  an  input  buffer  with  finite  storage  capacity.  When  the 
input  buffer  is  saturated,  all  input  terminals  are  temporarily  blocked 
from  further  transmission  and  hence  arriving  jobs  as  assumed  "lost" 
at  such  times,  Jobs  are  assigned  to  available  processors  according 
to  an  appropriate  scheduling  rule,  and  all  jobs  are  processed  to  com- 
pletion without  interruption.  Internal  memory  capacity  is  assumed 
sufficient  for  all  job/processor  combinations.  Completed  jobs  are 
released  from  the  system,  and  once  a job  is  assigned  to  a processor 
all  remaining  input/output  functions  are  performed  by  that  processor 
(either  individually  or  allocated  to  an  I/O  processing  unit) . No  hard- 
ware or  software  failures  are  considered  since  we  are  investigating  the 
relationships  among  the  job  arrival  process,  the  input/buffer  capacity 
and  the  number  of  processors.  Channel  capacity  and  interference  problems 
are  also  ignored. 

From  a queue  theoretic  terminology,  it  must  be  noted  that  the  system 
being  considered  is  the  multiserver  finite  waiting  room  queue  with  recur- 
rent arrivals  and  exponential  service  times  (the  G/M/s/N  queue  in  Kendall 
notation).  In  the  following  sections  we  develop  an  approximate  but  com- 
putationally attractive  procedure  for  deriving  system  utilization  measures 
based  on  arbitrary  time  behavior  of  the  system  using  imbedded  Markov  chain 
analysis  techniques,  Keeping  in  mind  that  these  results,  though  general 
in  nature,  will  be  used  mainly  in  computer  system  contexts,  we  adopt  the 
latter  terminology  so  as  to  make  it  easily  accessible  to  the  applied 


! j 


I 


II 


researcher. 


MATHEMATICAL  MODEL 


Definitions  and  Assumptions 

A busy  period  (BP)  is  defined  as  the  time  interval  during  which 
the  system  (at  least  one  processor)  is  continuously  busy.  A busy  period 
followed  by  an  idle  period  (during  which  no  jobs  are  processed)  is  defined 
as  a busy  cycle  (BC),  Viewing  the  system  on  a time  axis,  it  appears  to 
go  through  a sequence  of  busy  cycles.  Under  stationary  conditions  and  in 
the  steady-state,  the  mean  value  characteristics  derived  for  the  busy 
cycle  represent  the  corresponding  mean  value  characteristics  for  the 
system  behavior. 

Depending  on  the  number  of  jobs  in  the  system  (assigned  to  processors 
and  in  the  input  buffer)  during  a busy  period,  one  or  more  processors  are 
utilized.  The  following  definitions  relate  to  the  utilization  of  pro- 
cessors during  a busy  cycle! 

Processor  occupation  time  is  defined  as  the  sum  of  all  non-idle 
(processing)  intervals  for  a single  processor  during  a busy 
period. 

System  occupation  time  is  defined  as  the  sum  of  all  intervals  during 

which  at  least  one  processor  is  busy.  It  corresponds  to  the 

busy  period  defined  above. 

Measures  of  utilization  are  defined  below: 

Processor  Utilization  (PU) 

processor  occupation  time  . , 

mean  length  of  busy  period 

System  Utilization  (SU) 

Actual  utilization  time  for  all  processors 

Potential  utilization  time  for  all  processors  in  a busy  period 


processor  occupation  time 
system  occupation  time 


(2) 
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These  measures  reflect  two  slightly  different  prospectlves  on  the 
system  behavior.  PU  is  a measure  of  the  long  term  use  of  the  pro- 
cessors comprising  the  system  relative  to  their  period  of  availability. 
Thus,  PU  is  affected  strongly  by  the  nature  of  the  demand  for  pro- 
cessing service.  While  SU  is  also  affected  by  the  processing 
demand,  it  reflects  the  demand  influence  to  a lesser  degree.  SU 
measures  the  degree  to  which  the  ful]  processing  capability  of  the 
system  is  utilized  during  periods  when  service  is  being  rendered. 

With  no  restrictions  on  the  utilization  of  individual  processors 
and  the  assumption  of  identical  processing  rate,  the  service  load  is 
equally  distributed  among  the  processprs.  Let  p be  the  offered  service 
load  per  processor,  defined  as 


arrival  rate 

P (number  of  processors)  x (processing  rate) 

and  p*  be  the  effective  service  load  per  processor  resulting  from  the 

I 

j finite  buffer  capacity.  Let  PB  be  the  probability  that  a job  encoun- 

ters a filled  buffer  on  its  arrival.  Then  we  have 


(arrival  rate)  x (1-PB) 

(number  of  processors  x (processing  rate) 


I 


= (l-PB)p,  (3) 

The  processor  utilization  PU  can  be  given  by  p or  p*  accordingly.  Also, 
let  pQ  be  the  probability  that  the  system  is  idle  in  the  long  run.  An 
expression  for  pQ  can  be  given  as 

system  idle  time  during  a busy  cycle 
Po  mean  length  of  busy  cycle 

1 system  occupation  time 

mean  length  of  busy  cycle 

The  two  utilization  measures  PU  and  SU  have  the  relation 


8 


SU  = 1 

PU  1-p 


Thus  the  determination  of  the  two  utilization  measures  requires 

either  the  knowledge  of  the  occupation  time  during  a busy  cycle  or  the 

probability  of  blocking  PB  and  the  probability  of  system  idleness  p . 

o 

The  value  PB  is  obtained  as  the  probability  of  a filled  buffer  in  an 

arrival  epoch  steady-state  distribution,  and  the.  probability  p is 

o 

obtained  as  the  probability  of  emptiness  in  an  arbitrary  time 
steady-state  distribution.  Except  in  some  special  cases,  the  rela- 
tionship between  the  two  probability  distributions  (the  arrival  epoch 
and  the  arbitrary  time)  is  not  exactly  known  (see,  Takacs  [14] , Chapter 
1,  for  the  known  relation  for  an  infinite  buffer  capacity);  therefore, 
the  information  provided  by  the  arrival  epoch  distribution  is  not 
complete.  Consequently,  we  develop  an  alternate  procedure  which 
provides  approximate  values  of  utilization  measures  indicative  of  the 
operator-oriented  perspective.  As  illustrated  in  a subsequent  section, 


the  method  is  also  convenient  for  computational  use.  Additionally, 
the  potential  for  application  of  this  method  to  different  operating 


system  policies  seems  excellent. 


The  following  assumptions  are  stated  with  regard  to  the  basic  char- 
acteristics of  the  system.  The  repetition  of  certain  points  from  the 


previous  section  is  simply  to  place  all  assumptions  in  one  section. 
(1)  There  are  s identical  parallel  processors  in  the  system. 
Processing  times  for  individual  jobs  follow  an  exponential 
distribution  with  mean  1/y  for  each  processor. 


(2)  The  sequence  of  time  epochs  t^,  t^,  t^.  ...  mark  the  arrivals 

of  jobs.  Let  Z = t - t , (n=l,2,...).  We  assume  that 
n n n-1 

the  sequence  of  random  variables  {Z^}  aie  distributed  as 
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P [Z  < t]  = A(t)  (t  > 0) 
n - — 

and 

E[Z  ] = a 
n 


The  random  variables  Z , n = 1,2,,,.  are  assumed  to  be 

n 

independent  and  identically  distributed  throughout  our 
discussion.  This  is  done  only  for  convenience  and  the 
extension  to  a state-dependent  random  variable  presents 
no  difficulty, 

(3)  The  system  has  an  input  buffer  with  capacity  for  (N-s) 
waiting  jobs. 

(4)  Let  be  the  number  of  jobs  in  the  system  just  before  an 

arrival  att  (n  = 1,2,...),  IfJ  <s,  tht  arriving  job  is 
n n 

randomly  assigned  by  the  dispatcher  to  an  available  processor. 

If  s < < N,  the  arriving  job  is  assigned  to  the  input 

buffer  to  await  processing.  If  J = N,  the  job  arrival 

n 

stream  is  disabled. 

(5)  Once  assigned  to  a processor,  the  job  is  completely  serviced 
and  exits  the  system.  Any  further  input/output  requirements  of 
the  job  are  accomplished  by  the  assigned  processor  (either  by 
that  processor  or  an  assigned  I/O  processor). 

(6)  Internal  memory  capacity,  whether  shared  or  dedicated,  is 
sufficient  for  any  combination  of  processing  tasks,  and  the 
requirements  on  any  other  resources  are  reflected  in  the  pro- 
cessing time  of  each  job. 


The  Processes  J and  J(t) 
n 


Based  on  the  above  assumptions  we  note  that  the  process  {J^}  is  a 
finite  Markov  chain  with  transition  probabilities  (i,j  = 0,1,2,...N) 
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such  that 


where 


dP  (x) 

Nj 


dP  (x) 

ij 


“i s’ l “V10 


N-i 

(sp-x)  dA(x) 

Tn-3T: 

x 

r -spy  N-s-1 


(s  5 j 2 N) 


dA(x)  J e~5|Xy  |1-  e 

o (n-s-d:  J 

(j  < s)  (4) 


e S>iX(s4x)1"^1  dA(x) 
(i-j+DI 


(s  < j < 1+1) 


dA(x)  f ”S^(sjijr)  1_Ssn(S)  [l-e"|l(x"y)]S~1 

l (i-s):  j 


-jp(x-y) 


fj1)  [l-e”UX]i_^+1  e"j4x, 


dA(x) 


(8  < i,  J < S) 


(i  < s,  j < 1 + 1) 


mile  using  (4)  and  (5)  to  obtain  a in  a convenient  form,  we  introduce  the 
notation 


oo 

f -6x  4 

Y (5)  = \ e (5x)J  dA(x) 

J J J t 

n J • 
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We  obtain  the  following  expressions 


YN-j(s,° 


W .1 B 


(j  < s) 


Yi-j+l(SU) 


(s  < j < i+l;i  > s-l) 


/ s \ Z l (-Dr  (8"J)(ill^)£_i+s_1  Yj,(sy) 
\s-jj  £=i-s+l  [r*0 


(s  < i,  j < s)  (8) 


(“)T™'  (Tk 


f(r+j)p] 


(i  < s,  j < i+1) 


During  a busy  period  transitions  of  the  Markov  chain  {J^}  occur  only  among 


{1,2, Since  {J  } represents  the  state  of  the  system  only  at  arrival 

nn-0 


epochs,  we  must  determine  the  number  of  visits  to  different  states  between  arrivals 


in  order  to  derive  the  processor  occupation  time.  Therefore,  let  J(t)  be  the 


number  of  jobs  in  the  system  at  time  t.  We  can  obtain  the  processor  and  system 


occupation  times  during  a busy  period  in  two  stages: 


(1)  Determine  the  expected  number  of  visits  of  {J^}  to  states  1,2,...,N 


during  a busy  period. 


(2)  Determine  the  occupation  time  of  the  process  J(t)  in  states 


1,2,...,N  for  every  visit  of  {J^}  to  a particular  state. 


Of  these,  the  first  stage  follows  directly  from  the  theory  of  finite  Markov  chains. 


Partition  the  transition  probability  matrix  P of  the  Markov  chain  {J  } as 

n 


follows. 
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10 
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• 

H 
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j 
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Let 


(I-H) 


-1 


^11 

v12  ... 

vlN 

v2i 

^22  ... 
• 

V2N 

•V» 

• 

• 

v • • • 

N2 

i I 

NNJ 

(10) 


From  the  theory  of  finite  Markov  chains  (see,  Kemeny  and  Snell  [17]) 
we  know  that  the  expected  number  of  visits  of  the  process  to  state  j 
during  a busy  period,  having  initiated  from  state  i,  is  given  by  v 


ij 


For  the  second  stage,  we  divide  the  transitions  occurring  between 


two  consecutive  arrival  epochs  into  two  cases:  (i)  J =j  and  J ,,*k 

n n+1 


(>  0)  and  (±i)  J =1  and  J .,=0,  In  these  two  cases,  approximate 
n ' n+i 


expressions  for  occupation  times  of  the  process  J(t)  can  be  obtained 
as  follows, 

i Case  1:  During  the  inter-arrival  interval,  J(t)  passes  through 

the  states  j+1,  j,  k.  The  unconditional  occupation  time  of  J(t) 

in  state  r (r=j+l,  ],  ,,,,  k)  is  1/ry  for  r<s  and  1/sy  for  r>s.  However, 
these  transitions  are  observed  during  an  interval  with  mean  length  a 
(the  expected  inter-arrival  interval) ; consequently,  the  conditional 
occupation  time  in  state  r is  obtained  as 

a/min(r.s) 


_ /_•>  a/ uui 

fcj(r)  min(j+l,N) 


Mil 

dkj 


Z [l/min(m,s)] 

m-k 

1 m 1,2, ... ,r;  j * r— l,r,...,N, 


(ID 


where  a(r)  and  dj.  are  used  to  represent  expressions  in  the  numerator  and 


i 

Li 


U 


-i 
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denominator  of  (11)  respectively.  The  above  expression  results  if 
we  assume  that  the  mean  length  of  the  inter-arrival  period  is  a ir- 
respective of  the  numbers  j and  k and  note  that  within  such  a period 
J(t)  is  a death  process.  In  such  a process  with  death  rate  y per 
job,  the  mean  occupation  time  in  state  r is  1/ry  for  r<s  and  1/sy 
for  r>s.  Thus,  when  the  process  J(t)  goes  through  a transition  (j+1) 

-*•  k,  with  probability  a , , the  fraction  of  time  state  r is  occupied 

J k 

is  obtained  as 


(l/min(r,s) ]/d 


Hence  the  expression  (11)  abive  is  derived. 

Case  (ii):  At  the  conclusion  of  a busy  period,  i.e.,  when  J =1, 

n 

Jn+1=0,  the  amount  of  time  required  for  first  passage  to  zero  is 
dependent  on  the  initial  state  j;  consequently,  the  arguments  used  in 
Case  (i)  to  obtain  the  state  occupation  times  do  not  hold.  Retreating 


to  basic  arguments,  we  denote  by  Y^  the  occupation  time  in  state  r. 
Note  that  the  distribution  of  Y is  a conditional  exponential  with 

r j+1 

parameter  ry  such  that  neither  Y not  . I.  Y exceed  the  length  of  the 

r i=l  1 j+i 

inter-arrival  period  Z , Let  c . (y)  be  the  p.d.f,  of  Y . , We  have 

n j i=l  i 

for  Z = z and  0 < y < z 
n 


Cj  (y)dy 


(j+l)[l-e“Wy]je  %dy 


0 < j < s 


W.-S+lxJ-s  sll-e-^y-^]8'1  e'^y^ydxdy 


s < j < N 


x-0  (j-s-1) ! 


lu 


Therefore  we  obtain  a more  complex  representation  of  the  conditional 
occupation  time  in  state  r. 


' • y 
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Vr) 


-min(r,s)yy 


r Ll 

0 l'V>* 


Qnin(r  ,s)Jydy 


dA(z) 


1 = r-l,r, 


Clearly,  the  evaluation  of  lj(r)  presents  some  difficulty.  For  the 

purposes  of  numerical  investigations  we  suggest  the  following  appro- 
% 

ximation  a^ (r)  using  unconditional  means.  Let 

1 min  (j+l,N) 

a'  = min  {—  ^ [l/min(i,s)  ] , a}  (13) 

1=0,1,. ...N. 


and  write 


a.(r) 


a'/min(r,s)  a.( 

a (r)  = J • ■ . sr  - . 

oj  min(N, j+1)  d , 

E [l/min(m,s)] 


where  the  numerator  of  (14)  is  denoted  as  a.(r).  Clearly  a . (r)  over- 

J oj 

estimates  a^ (r)  in  case  (ii).  However,  for  moderate  to  large  values  of 
j,  ctjQ  is  expected  to  be  very  close  to  zero,  and  the  effect  of  approxi- 
mation is  almost  assuredly  negligible. 


Processor  Utilization 

From  the  results  derived  in  the  last  section,  we  develop  expressions 
for  the  expected  state  occupation  times  E(Sr)  during  an  expected  busy 
cycle  E(BC).  By  definition 


E(BC) 


E E(S  ) ; 

r=0  r 


also,  the  mean  busy  cycle  can  be  derived  using  the  mean  first  passage 
times  (v^)  of  (10).  Noting  that  these  first  passages  are  conditional 
on  the  busy  cycle  extending  beyond  the  first  inter-arrival  period,  we 


E(BC)  = a[l  + y (y)  E v ] 

1=1 


write 


(16) 
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where  Yn(y)  ■ f° e ^XdA(x)  Is  the  probability  that  the  busy  cycle 
U 0 

extends  beyond  the  first  inter-arrival  period. 

Processor  utilization  requires  the  determination  of  individual 

state  occupation  times.  Let  E (S  ) be  the  occupation  time  of  state 

c r 

r conditional  on  the  busy  cycle  extending  beyond  one  transition 
interval.  Considering  the  number  of  visits  of  the  process  {Jr}  to 
different  states  and  the  state  occupation  times  of  the  process 
(J(t)}  between  transition  epochs,  we  obtain  (using  the  approximation 
suggested  earlier) 


“jk  \j(r)  + “jo  Vr) 


M»r>  - t VU  * «lk 

j=max(r-l, 1)  k=l  J 


E V11  a(r)  Z [ajk/dkj]  + a (r)  a /d 

j-max(r-l.l)  k=l  L j 17 


r = 1,2,3, (17) 


Removal  of  the  condition  on  state  occupation  times  results  in 


E(Sr)  = Y0(y)  Ec(Sr) 


r = 2,3, ...,N.  (18) 


The  expression  for  E(S^)  must  also  include  the  possibility  of  termination  of 
the  busy  cycle  with  only  one  service.  Thus  we  get 


I L 


E(S1)  = a0(l)(l-Y0(p)]  + [a  + Ec(S1)  1Y0(U)  (19) 

where  3^(1)  is  to  be  obtained  from  (12).  As  an  approximation  for  1^(1),  we 
may  use  aQ  given  by  (13  ) . 

Expressions  for  the  expected  processor  occupation  time  (E  ) and  the  expected 

pot 

system  occupation  time  (E  ) follow  directly 

sot 


s-1  N 

Z - E(S  ) + Z E(S  ) 


l m Z E(S  ) 
sot  , r 
r-1 


(21) 
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The  two  measures  of  utilization  suggested  earlier  can  be  given  as 


(1)  processor  utilization 


PU  * e(bc)  • and 

(2)  system  utilization 


SU  = -E^ 

E 

sot 

In  the  first  expression  E(BC)  is  obtained  as  in  (16),  Because  of  an 
approximation  in  (13)  and  (14),  when  the  mean  inter-arrival  time  is 

smaller  than  the  mean  processing  time  (which  is  possible  with  more  than 

% 

one  processor),  E(BC)  = E . Although  more  exact  evaluations  of  (12) 

sot 

are  possible  in  order  to  maintain  the  distinction  between  E(BC)  and 

E , we  feel  that  the  additional  information  that  can  be  derived  is 
sot 

not  justifiable  (especially  considering  the  likelihood  of  introducing 
error  in  computing  the  ratios  of  the  integrals) . 

As  a result  of  the  approximation  in  (12),  the  numerical  values 
obtained  here  slightly  overestimate  the  utilization  measures.  When 
the  arrival  rate  of  jobs  relative  to  their  processing  rate  is  low,  the 


blocking  probability  is  negligible  and  the  processor  utilization  PU 
is  very  close  to  p (defined  earlier).  Therefore,  as  a correction  for 


our  utilization  measure,  we  write 


PU  = min[E  /E(BC),  p] 
pot 


An  approximate  steady  state  distribution  of  the  number  of  jobs  in 
the  system  at  an  arbitrary  time  point  also  follows  easily  from  our 


results.  We  have 


E(Sr) 

pr  * E(BC)  ’ 

N 

p„  - 1 - ^ v 


r - 1,2,3, ...,N 
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Furthermore,  using  the  discussion  following  equation  (3)  we  determine  the  probability 


of  blocking  as 


PB  - 1 - PU/p 


COMPUTATIONAL  MODEL 


In  converting  the  mathematical  model  into  a computational  model 


we  have  approximated 


V-)  - r v — 

J Of  c (y)dy 


{ ye  mln^r,S^yXin(r,s)udy 


dA(z) 


j r*l ,r , . • • ,N 


by  a^  (r)  where 

af/min(r,s) 

a01  (r>  " £ 

J min(N,j+l) 

I [l/min(m,  s) ] 
m=l 


min(N, j+1) 

a'  - min  {1_  E [l/min(i,s)]  , a} 
-*  p i=l 


Three  other  aspects  of  the  computational  model  deserve  mention.  The 
first  involves  the  computation  of  the  values  y(&)  given  in  (6)  as 


y,(6)  = I*  e"6x(£x)£ 


dA(x) 


For  any  arrival  process  we  compute  all  values  j = 1,2,...,N  such  that 


with  a prescribed  error  bound. 

The  second  aspect  is  concerned  with  the  calculation  of  the  infinite 
sums  contained  in  (u)  and  (8),  e.g.  the  second  expression  in  (7). 
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°NJ 


( 8 ) 


n 

i 

A-N-s 


s“j  r S-1  s-1-r  *-*”■» 

E (-l)r  (srJ)(S-l-i) 
r-0 


ya(sw) 


where  n Is  such  that 


j < s 


s 

ls/2jJ 


Yn+l(sw)  <e2 


where  IxJ  Is  the  greatest  Integer  < x (the  floor  function). 

This  rather  simple  single  term  cutoff  is  justified  by  the  fact  that  the  Y^(sp) 
values  are  probabilities  and  are  strictly  monotone  non-increasing  with  increasing 
values  of  j . The  truncation  term  n is  computed  only  once  since  we  can  easily  show 
that  the  contribution  of  the  similar  subexpression  in  (8)  cannot  exceed 

The  final  aspect  of  the  computational  model  concerns  the  determination  of  the 
matrix  (I-H)  \ Observing  that  the  probability  transition  matrix  has  the  lower 
Hessenberg  structure,  i.e. 


00  “oi 

0 

l10  “ll  “l2 

• • 
• 

N-1,0  aN-l,l  ... 

°N-1,N 

NO  °N1  . ... 

aNN 

1 


we  use  a Gaussian  elimination  method  with  row  pivoting  for  solving  the  linear 
system  in  a very  efficient  manner. 

Empirical  results  are  obtained  from  FORTRAN  programs  developed  and  executed 
using  the  FTN  compiler  on  a CDC  6700  and  the  G-Level  FORTRAN  IV  compiler  on  a 
dual  IBM  S370/158  system.  All  programming  was  done  by  the  authors  except  for  the 
Gaussian  elimination  routine  provided  by  Professor  James  E.  Kalan. 


EXPERIMENTAL  RESULTS 


Experiments  with  the  model  focus  on  four  behavioral  variables: 

(1)  the  arbitrary  time  state  probability  distribution,  the 
determination  of  which  is  a major  oroblem  by  itself, 

(2)  the  expected  busy  cycle, 

(3)  the  processor  utilization  measure  PU  and 

(4)  the  system  utilization  measure  SU, 

The  measures  (2),  (3)  and  (4)  can  be  obtained  easily  from  (1);  yet 
each  offers  an  added  insight  into  the  total  behavior.  We  also  provide 
the  expected  number  of  jobs  in  the  input  buffer. 

Our  intent  is  to  determine  the  behavior  of  the  multiprocessor 
model  under  three  conditions: 

(1)  differing  variability  levels  in  the  job  inter-arrival  time 

distribution  — using  an  Erlangian  (k,X)  with  k = 2,  X = .125; 
k = 4,  X = .250  and  k = 8,  X = .500,  and  coefficient  of  varia- 
tion (C.  V.  = standard  deviation^  values  of  , 707  ,500  and  .350, 

mean  / 

respectively; 

(2)  increasing  demand  on  a system  with  a fixed  number  of  homogeneous 
processors,  each  having  an  identical  processing  rate;  and 

(3)  testing  the  relationship  between  the  number  of  processors  and 
individual  processor  capability  in  a homogeneous  multiprocessor 
system. 

Figures  and  tables  are  used  to  summarize  the  results. 

Tables  la  and  lb  illustrate  the  effect  of  the  variability  of  the 
inter-arrival  distribution  on  a system  with  a buffer  capacity  of  6 jobs 
(N-s=6).  Table  la  gives  results  for  a four  processor  system  and  Table 
lb  gives  results  for  a two  processor  system.  Keeping  the  offered  load 
per  processor  constant  three  cases  with  different  coefficient  of  varia- 
tions are  considered.  The  limited  numerical  results  presented  in  the 
tables  indicate  general  patterns  of  behavior  of  effectiveness  measures 


as  follows 


• - 4 

H - 10 

Erlanglan  Distribution 

PB 

E(BC) 

Average  Ho.  Jobs 

PU 

SU 

of  Intsr-arrlval  Tinas 

Probability* 

Expected 

In  Input  Buffer 

Processor 

System 

of  blocking 

Busy  Cycle 

Utilisation 

Utilization 

Offered  load  par  procasaor 

P - 1.25  , processing  rate  y 

- 0.0125 

1-  .125,  .*-2?  C.V.-  .707 

.2146 

15966 

3.99 

.9818 

.9824 

X-  .25<X  k-4  , C.V.-  .500 

.2108 

30612 

4.11 

.9865 

.9869 

X-  . 50(\  k-8  , C.V.-  .353 

.2088 

46412 

4.19 

.9890 

' .9893 

Offarad  load  par  processor 

p - 1.00  , processing  rate  y 

- 0.0156 

X-.125  ,.fc-  2,  C.V.-  .707 

.0758 

2399 

2.51 

.9242 

.9280 

X-.250  , k-  4,  C.V.-  .500 

.0717 

3296 

2.46 

.9283 

.9318 

X-.500  , fc-  8,  C.V.-  .353 

.0693 

. 

' 4028 

2.46 

.9307 

.9340 

Offered  load  par  processor 

p - 0.75  , processing  rate  y 

- 0.0208 

X-.125  ,.k-2  f C.V.-  .707 

.0 

447 

.81 

.7499 

.7806 

X-.250  , k-4  , C.V.-  .500 

.0010 

494 

.64 

.7492 

.7663 

X-.500  , k-8  , C.V.-  .353 

.0136 

528 

.54 

.7398 

.7575 

. 

Offered  load  per  processor 

P • 0.050  , pri 

ocesslng  rate  y 

- 0.0313 

X-  .125  , .k-  2 , C.V.-  .707 

.0 

119 

.10 

.5000 

.5730 

X-  .250  , k-4  , C.V.-  .500 

.0 

118 

.05 

.5000 

.5480 

X-  .500  # k-8  t C.V.-  .353  j 

.0201 

118 

.03 

.4899 

‘ .5344 

Table  la.  Behavior  with  Differing  Variability  Levels  in  the  Job  Inter-Arrival 
Time  Distribution 




s - 2 

H - 8 

1 

Erlanglan  Distribution 

PB 

E(BC) 

Average  No.  Jobs 

PO 

SO 

of  Inter-arrival  Tlasa 

Probability 

Expected 

In  Input  Buffer 

Processor 

Systea 

of  blocking 

Busy  Cycle 

Utilization 

Utilization 

Offered  load  per  procesaor 

0 “ 1,25  .'processing  rate  u 

■ 0.025 

X-.125  ,>  2,  C,V.-  ,707 

,2166 

1215 

4,05 

.9793 

.9858 

*■,250  , k-  4,  C.V.-  ,500 

.2134 

1687 

4.16 

.9833 

.9890 

1"  ,500  , k-  8,  C.V.-  ,353 

,2115 

2098 

4.23 

.9856 

.9907 

Offered  load  per  processor 

P - 1.00  , processing  rata  y 

- 0.0313 

A-  .125  , k-  2 , C,V.-  .707 

.0746 

296 

2.70 

.9254 

.9493 

A-  .250  , k-4  , C.V.-  .500 

.0757 

317 

2.65 

.9243 

.9505 

A- .500  , k-8,  C.V.-  .353 

.0762 

332 

2.63 

.9238 

.9513 

Offered  load  per  processor 

p - 0.75  , processing  rate  y 

- 0.0417 

A-.125  ,k-2  , C.V.-  .707 

.0 

91 

1.09 

.7499 

.8550 

X-.250  , k-4  , C.V.-  .500 

.0 

84 

0.88 

.7499 

.8398 

X-.500  , k-8  , C.V.-  .353 

.001 

81 

0.76 

.7492 

.8300 

Offered  load  per  processor 

0 - 0.50  , processing  rate  y 

- 0.0625 

A-  .125  ,,k-2  ? C.V,-  .707 

.0 

40 

0.23 

.5000 

.7141 

A-  .250  , k-4  , C.T.-  .500 

.0 

37 

0.14 

.5000 

.6845 

A- .500  , k-8  , C.V.-  .353 

.0 

35 

0.10 

.5000 

.6674 

i 

I Table  lb,  Behavior  with  Differing  Variability  Levels  in  the  Job  Inter-Arrival 

Time  Distribution 
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Figure  2.  The  Arbitrary  Time  State  Probability  Function  for 
Deterministic  Inter-arrival  Times  (T) . 


Comparative  Behavior  of  Two-  and  Four-Processor  Systems  under  Increasing  Demand. 
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to  the  4-processor  curve  for  T=80  suggests  that  perhaps  the  2-processor 
system  is  preferable  since  it  provides  roughly  analogous  behavior  under 
a heavier  demand.  The  values  for  the  blocking  probability  and  processor 
utilization  shown  in  Table  2 also  contribute  to  suggesting  a 2-processor 
system  as  preferable.  However,  two  significant  facts  have  not  been 
considered: 

(1)  the  buffer  usage  with  the  4-processor  system  is  slightly  less 
under  higher  demand,  giving  an  indication  that  a lower  buffer 
capacity  could  be  used  in  a 4-processor  system;  and 

(2)  most  importantly,  the  cost  differential  for  the  less  capable 
processors  comprising  the  4-processor  system  could  exceed  the 
factor  of  2 by  a considerable  amount. 

However,  we  recognize  that  a 4-processor  system  introduces  added  overhead,  not 
considered  in  the  model.  All  considered,  we  must  conclude  that  a general  advantage 
for  the  2-processor  system  cannot  be  based  on  the  derived  behavior. 

Note  the  directional  shifts  in  the  expected  busy  cycle  for  the  2-processor 
system  (T  » 80,  45,  40),  which  is  not  demonstrated  by  the  4-processor  system.  We 
suspect  that  this  difference  in  behavior  stems  from  the  longer  idle  periods  under 
low  demand  for  the  2-processor  system.  As  the  demand  increases,  the  idle  period 
exceeds  the  increase  in  the  busy  period  for  a brief  time.  Also,  note  the  tremen- 
dous increase  in  the  busy  cycle  for  the  2-processor  system  as  the  inter-arrival 
time  goes  from  15  to  10.  The  magnitude  of  this  jump  suggests  that  a 4-processor 
system  might  be  more  capable  of  adjusting  to  increasing  demand.  We  hesitate  to 
offer  this  conclusion  without  further  study. 

As  a final  point,  we  remind  the  reader  that  the  model  is  developed  from  the 
operator  perspective.  No  measures  reflecting  the  user  perspective,  such  as  response 
time,  are  included  as  behavioral  variables.  A complete  evaluation 
would  include  both  cost  figures  and  behavioral  variables  reflecting  the  user 


perspective. 
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SUMMARY  AND  CONCLUSIONS 

We  have  developed  a detailed  model  of  processor  utilization  in  a 
homogeneous  multiprocessor  computer  system.  The  model  assumes  a general 
input  process  and  an  exponential  processing  time  for  each  processor.  A 
finite  capacity  input  buffer  is  used  when  no  processor  is  available. 

The  modeling  approach  also  derives  an  approximate  arbitrary  time  state 
probability  distribution,  numerical  results  for  which  can  be  derived 
only  by  considerably  more  effort.  The  expected  busy  cycle  follows 
directly  from  the  arbitrary  time  state  probability  distribution,  and 
two  measures  of  processor  utilization  are  obtained.  A computational 
model,  requiring  several  approximations,  is  developed  from  the 
mathematical  model.  Experimentation  with  varying  input  distributions 
leads  to  the  following  conclusions: 

(1)  When  the  offered  load  per  processor  is  large,  a highly 
variable  input  process  (an  inter-arrival  time  with  high 
variance)  causes  smaller  busy  cycles,  average  number  of 

jobs  waiting  in  the  buffer,  and  processor  and  system  utiliza- 
tion; but  increased  probability  of  blocking. 

(2)  When  the  offered  load  per  processor  is  small  the  conclusions  of 
(1)  above  are  reversed, 

(3)  Strictly  from  the  operator  perspective  and  neglecting  processor 
costs  and  buffer  usage,  a system  with  two  processors,  each 
having  twice  the  processing  rate  of  single  processors  in  a 
4-processor  system,  gives  preferable  behavior. 
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In  this  paper  a processor  utilization  model  for  a simplified  multi-proces- 
sor  computer  system  is  developed  using  a G/M/s/N  queueing  system  model.  The 
mathematical  model  is  based  on  the  busy  period  analysis,  and  two  utilization 
measures  are  derived:  (1)  Processor  utilization  (the  fraction  of  processor 

occupation  time  during  a busy  cycle),  and  (2)  System  utilization  (the  fraction 
of  actual  utilization  time  for  all  processors) . Experimentation  with  the  com- 
putational model  reveals  the  sensitivity  of  the  model  to  variability  in  the 
arrival  process. K — 
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